Corpus-based Synthesis of F0 Conto Using the Generation P

نویسندگان

  • Keikichi Hirose
  • Toshiya Katsura
  • Nobuaki Minematsu
چکیده

A corpus-based generation of fundamental frequency (F0) contours was realized for emotional speech synthesis. The method, originally developed for read speech, is to predict command values of the F0 contour generation process model with the input of linguistic information of the sentence to be synthesized. Since the generated F0 contour is under the model constraint, a certain quality is still kept in synthesized speech even if the prediction is done poorly. The speech corpus used for the F0 contour generation experiments includes three types of emotional (anger, joy, sad) and calm speech uttered by a female narrator. The command values necessary for the training and evaluation of the method were automatically extracted using a program developed by the authors. We also applied the method to predict segmental durations. The mismatches between the predicted and target contours/durations for emotional speech were similar to those for calm speech.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Realization of Prosodic Focuses in Corpus-based Generation of Fundamental Frequency Contours of Japanese Based on the Generation Process Model

A method was developed for generating sentence F0 contours of Japanese, when a focus is placed in one of the “bunsetsu” of an utterance. It controls F0 based on the F0 model; not frame-byframe F0 prediction as in the case of HMM-based speech synthesis. The method first predicts differences in the F0 model commands between utterances with and without focus, and then applies them to the F0 model ...

متن کامل

Corpus-based generation of prosodic features from text based on generation process model

A total scheme of generating prosodic features from a text input was constructed. The method consists of corpus-based prediction of pauses, phone durations and fundamental frequencies (F0's), in this order, and information predicted in an earlier process is utilized in the following processes. Since prediction of F0's is done on the command values of F0 contour generation process model instead ...

متن کامل

Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model

We have been developing corpus-based synthesis of fundamental frequency (F0) contours for Japanese. Since, in our method, the synthesis is done under the constraint of F0 contour generation process model, a rather good quality is still kept even if the prediction process is done poorly. Although it was already shown that the synthesized F0 contours sounded as highly natural as those using heuri...

متن کامل

Improvement in corpus-based generation of F0 contours using generation process model for emotional speech synthesis

In our fully automatic corpus-based method of generating fundamental frequency (F0) contours for emotional speech synthesis, an improvement was realized related to the process of corpus preparation. The method assumes the generation process model and predicts its command parameters using binary regression trees with inputs of linguistic information of the sentence to be synthesized. Because of ...

متن کامل

Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model

A corpus-based method of generating fundamental frequency (F0) contours of various speaking styles from text was developed. Instead of directly predicting F0 values, the method predicts command values of the F0 contour generation process model. Because of the model constraint, the resulting F0 contour keeps certain naturalness even when the prediction is done incorrectly. The method includes a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003